This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



THIS PAGE BLANK (uspto) 




AP972 02035 

pW/ACJ.97/00351 ' " 



-J^U S^AL IA /Ct ^ 



REC'D 2 4 JUN 1997 



I W!PO^ PCT 

Patent Office 
Canberra 



I, DAVID DANIEL CLARKE , ASSISTANT DIRECTOR PATENT SERVICES, 
hereby certify that the annexed are true copies of the Provisional specification and 
drawing(s) as filed on 5 June 1996 in connection with Application No. PO 0265 for a 
patent by BIOMOLECULAR RESEARCH INSTITUTE LTD filed on 5 June 1 996. 

I further certify that the annexed documents are not, as yet, open to public inspection. 




PRIORITY DOCUMENT 



WITNESS my hand this Twelfth 
day of June 1997 



DAVID DANIEL CLARKE 

ASSISTANT DIRECTOR PATENT SERVICES 



f 



AUSTRALIAN 

PROVISIONAL No. DATE Or FILING 

P00265 -SJUN.96 

PATENT OFFICE 



AUSTRALIA 
Patents Act 1990 



PROVISIONAL SPECIFICATION 



Applicant (s) : BIOMOLECULAR RESEARCH INSTITUTE LTD 

A.C.N. 050 135 012 
Invention Title: VIRAL PEPTIDE 



The invention is described in the following statement 



- 2 - 



VIRAL PEPTIDE 

This invention relates to viruses of tlie family 
Paramyxoviridae, particularly viruses of the respiratory 
syncytial virus group. More particularly, the invention 
5 relates to attachment protein of these viruses, and to the 
structure of the region of the attachment protein which is 
involved with binding to the cellular receptor for the 
virus . 

Background of the Invention 

10 Respiratory syncytial viruses are significant 

pathogens of human and animals throughout the world. 
Virtually all humans become infected with human respiratory 
syncytial virus (RSV) by two years of age, and repeated 
infections occur throughout life. RSV is regarded as the 

15 most serious respiratory pathogen of infants and young 
children, but it can also cause serious disease in 
immunocompromised adults and in the elderly. Serious cases 
of infection manifest in bronchiolitis and pneumonia, and 
can be fatal. Estimates of the impact of RSV infection 

20 indicate that it results in 91,000 hospital admissions 

annually in the United States of America (Heilman, 1990) 
and hospitalisation of 1% of children before the age of 12 
months in Britain (Cane and Pringle, 1995). Epidemics of 
the virus occur on an annual basis coincidental with other 

25 viruses, such as influenza and parainfluenza. 

Natural immunity does not appear to provide 
protection against RSV infection (Mcintosh and Chanock, 
1990; Hall, 1994). Even infants provided with matefnal 
antibodies are susceptible to RSV infection (Mcintosh, 

30 1990). Vaccine development strategies to combat RSV have 
not bBen successful, and in one study a formalin- 
: inactivated virus vaccine actually exacerbated disease 
(Mcintosh and Chanock, 1990; Hall, 1994). The only 
pharmaceutical agent presently available to threat RSV, 



staff/na/keep/S P EC I S/B IOM OL ECU LA R . viral 4.6 



- 3 - 

Ribavirin, is expensive and complex to deliver as an 
aerosol, and is of Questionable efficacy (Mcintosh, 199 0; 
Levin, 1994). Thus it is apparent that a greater 
understanding of the infectious mechanism and immunobiology 
5 of RSV is required to develop control measures based on 
vaccines or antiviral agents. 

RSV belongs to the Pneumovizrus genus of the 
Paramyxoviridae family of single strand negative sense RNA 
viruses, which includes other serious pathogens such as 

10 parainfluenza, mumps and measles (Mcintosh, 1990; 

Kingsbury, 1990) and the recently identified zoonotic, 
ecjuine morbillivirus (Murray et al, 1995) . Like other 
Paramyxoviridase, RSV has two membrane glycoproteins which 
mediate invasion of susceptible cells (Morrison and 

15 Portner, 1991) . One protein, the large glycoprotein or 
G protein, functions in attachment to cells. The other, 
the so-called fusion or F protein, causes fusion between 
the lipid of the viral membrane envelope and the cell 
plasma membrane lipid bilayer. RSV infection can also be 

20 transmitted by fusion of membranes of infected cells, which 
have F protein expressed on their surface, with adjacent 
cells. 

The molecular architecture of the F protein is 
conserved between all members of the three genera of the 

25 Paramyxoviridae; however, each genus has a characteristic 
attachment protein (Morrison and Portner, 1991). Members 
of the Paramyxovirus genus have attachment proteins with 
neuraminidase and haemagglutinating activities; the 
attachment proteins of the Morbillivirus genus are 

30 haemagglutinins, but lack neuraminidase activity; and the 
attachment proteins of Pneumoviruses lack both 
haemagglutination and neuraminidase properties. Attachment 
proteins of the Paramyxovirus (Morrison and Portner, 19 91) 
and Morhilli virus (Dore-Duffy and Howe, 1978) genera 

3 5 participate in sialic acid receptor- type interactions, 

which account for their ability to agglutinate red blood 



stafl/na/keepySPECIS/BIOMOLECULAR. viral 4.6 



- 4 - 

cells. RSV is also reported to interact with sialic acid; 
however, the mechanism of RSV G protein attachment and the 
identity of the cellular receptor for the 6 protein are not 
yet known (Markwell, 1991) • 
5 The Paramyxoviridae F protein is invariably a 

type I integral membrane protein, but the attachment 
proteins are all type II integral membrane proteins . The 
oligosaccharide compositions of the Paramyxovirus and 
Morbillivirus attachment proteins are typical of integral 

10 membrane proteins (Morrison and Portner, 1991)*, but the RSV 
attachment protein, also termed the G protein, appears to 
contain an unusually high proportion of carbohydrate 
(Morrison and Portner, 1991; reviewed by Sul lender and 
Wertz, 1991) . The gene for the RSV strain A2 attachment 

15 protein encodes a potential primary translation product of 
298 amino acids with a theoretical Mr of 32588 (Satake et- 
al, 1985; Wertz et al, 1985), but has an apparent molecular 
weight of 80,000-90,000 (Levine, 1977; Gruber and Levine, 
1983; Lambert and Pons, 1983) as estimated by 

20 electrophoresis in polyacrylamide gels containing sodium - 
dodecylsulfate (SDS-PAGE) . The unusually high molecular ; 
weight of the RSV attachment protein as measured by 
SDS-PAGE has been attributed to a high content of both O- 
and N-linked oligosaccharides (Gruber and Levine, 1985; 

25 Lambert, 1988) . 

It has been suggested that the conserved region 
in the central part of the G protein ectodomain may be 
involved in ligand interactions with a cellular receptor 
for the G protein (Collins, 1991; Johnson et al, 1987); 

30 however, this remains to be demonstrated. Peptide epitope 
scanning experiments, using nested sets of synthetic 
peptides, have identified the cysteine-containing region as 
a subtype-specific antigenic determinant (Norrby et al, 
. 1987) which shows some dependence on oxidation of the 

35 cysteines to cystine ( Akerlind-Stopner et al, 1990) . 

Monoclonal antibodies which react with this region also 

stafi/aa/koop/SPECIS/BIOMOLECULAR.vtral 4.6 



- 5 - 

block binding of the G protein to cells (Feldman and 
Hendry, 1996) . However, the disulphide linkage status of 
this region remains to be elucidated. In addition, the 
glycosylation status of the serine and threonine residues 
5 in the conserved domain and around the conserved cysteine 
residues or the Asn-Pro-Thr sequence between cysteines 17 6 
and 182 of the ectodomain has not been determined, and the 
sites of disulphide bonds are not known. 

We have now found that the four cysteine residues 

10 of the ectodomain form a uniform disulphide bo'hd pattern, 
and that the region of the protein around these residues 
and the conserved region lack oligosaccharide attachment. 
This finding has significance for the design and production 
of vaccines, diagnostic reagents, and therapeutic compounds 

15 for human RSV and related viruses . 

In particular, the absence of glycosylation 
leaves this region of the ectodomain open for receptor 
binding. Heretofore, the possibility of attached 
oligosaccharide being present in this region has meant that 

2 0 the design and interpretation of experiments to examine 

binding of the virus to its cellular receptor has been 
exceedingly difficult, and accurate interpretation almost 
impossible, because of the level of variability made 
possible by glycosylation. 

25 Summary of the Invention 

According to one aspect, the invention provides a 
compound having structural homology to a contiguous 
sequence of amino acids within the secjuence representing 
residues 149-197 of the G protein of respiratory syncytial 

3 0 virus, in which 

a) no oligosaccharide is linked to potential 
serine, threonine or asparagine attachment sites; 

b) four cysteine residues are involved in 
disulphide linkages; and 

staff/na/keep/SPECJS/BIOMOLECULAR.viral 4.6 



- 6 - 



c) the pattern of disulphide linkage is 
Cys 17 3 linked to Cys 186, and Cys 17 6 linked to Cys 182, 

and in which, said compound possesses a biological 
activity of respiratory syncytial virus G protein. 
5 For the purposes of this specification, a 

biological activity of respiratory syncytial virus 
G protein is defined as one or more of 

a) the ability to bind to one or more 
antibodies selected from the group consisting of rabbit 

10 polyclonal antibody directed against RSV, murine monoclonal 
antibody directed against RSV, and antibodies present in 
human convalescent sera from patients infected with RSV; 
and 

b) the ability to bind to cells capable of 
15 being infected with RSV. 

Preferably the virus is selected from the group 
consisting of human RSV subtype A, human RSV subtype B, 
bovine RSV, and mutants and variants thereof. 

More preferably the compound is a peptide 
20 corresponding to amino acids 158 to 19 6 of the 

RSV G protein. Even more preferably the peptide 
corresponds to amino acids 165 to 187 of the RSV G protein. 

Most preferably the compound is a peptide having- 
one of the following amino acid sequences: 



25 


SEQ 


ID 


NO 


1 




SEQ 


ID 


NO 


2 




SEQ 


ID 


NO 


3 




SEQ 


ID 


NO 


4 




SEQ 


ID 


NO 


5 


30 


SEQ 


ID 


NO 


6 




SEQ 


ID 


NO 


7 




SEQ 


ID 


NO 


6 




SEQ 


ID 


NO 


9 




SEQ *1D NO 


10 


35 


SEQ 


ID 


NO 


11 




SEQ 


ID 


NO 


12 




SEQ 


ID 


NO 


13 



KQRQNKPPSKPNNDFHFEVFNFVPCSICSNNPTCWAICKRIPNKKPGKK 
N „. 

R 

— H 

N 

N 1 

N 

R 

-S-SKN — K — KD-Y G — QLt-KS T — SN — K — 

-S-SKN — K — KD-Y G- -QL-KS T — SN — K — 

-P-PKN — K — KD-Y G — QL-KS ---T — SN — K — 

-P -LKN — K — KD-Y G- -QL-KS ---T — SN — K-- 

-P-LKN — K — KD-Y G — QL-KS T-SSN — K — 

staft/na/keep/S P EC IS/B IOMOLECULA R viral 4.6 



i 



f 



SEQ ID NO 14 -P-LKN — K — KD-Y G — QL.-KS T — SN — 

SEQ ID NO 15 -S-SKN — K — KD-Y G — QL.-KS T--SN--K — 

SEQ ID NO 16 NPSGSI — ENHQDHNN-QTLPY T-EG-LA-LSL-HIETERA-SRA 

SEQ ID NO 17 P T R 

SEQ ID NO 18 S R--T 



Sequences encompassing conservative substitutions 
of amino acids are within the scope of the invention, 
provided that the biological activity is retained. 

It is to be clearly understood that the compounds 
of the invention include peptide analogues, including but 
not limited to the following: 

1. Compounds in which one or more amino acids 
is replaced by its corresponding D-analogue. The skilled 
person will be aware that retro-inverso amino acid 
sequences can be synthesised by standard methods; see for 
example Choreo and Goodman, 1993; 

2. Peptidomimetic compounds, in which the 
peptide bond is replaced by a structure more resistant to 
metabolic degradation. See for example Olson et al, 1993; 
and 

3 . Compounds in which individual amino acids 
are replaced by analogous structures for example, 
Srem-diaminoalkyl groups or alkylmalonyl groups, with or 
without modified termini or alkyl, acyl or amine 
substitutions to modify their charge. 

The use of such alternative structures can 
provide significantly longer half -life in the body, since 
they are more resistant to breakdown under physiological 
conditions . 

Because of the biological activity of the * 
compounds of the invention, they are useful as therapeutic 
and diagnostic agents, and are also useful in the screening 
in .order to identify compounds capable of inhibiting 
binding of the virus to its host cell. Therefore the 
invention also provides 

staH/na/keep/SPECIS/BIOMOLECULAR.viral 4.6 



- 8 - 

a) a diagnostic composition comprising 1 a 
compound of the invention; 

b) a pharmaceutical composition comprising a 
compound of the invention together with a pharmaceutical ly 

5 acceptable carrier; 

c) a method of prevention or treatment of 
Pneumovirus infection comprising the step of administering 
an effective amount of a compound of the invention to a 
mammal in need of such treatment; and 

10 d) a method of diagnosis of Pneumovirus 

infection, comprising exposing a biological fluid or sample 
from a mammal suspected of being infected with said virus 
to a compound of the invention, and measuring the 
interaction between the compound and said fluid or sample, 

15 Diagnostic kits are also within the scope of the 

invention. 

Because at least some compounds of the invention 
are immunogenic, in a further aspect the invention provides 
a method of immunisation against Pneumovirus infection, 
20 comprising the step of immunising a mammal at risk of such 
infection with an immunising-ef f ective dose of a compound 
of the invention, said compound being immunogenic and 
having the ability to elicit protective antibody. 

Similarly, even where the antibodies produced in 
25 response to immunisation with a compound of the invention 
are not protective, such antibodies will be useful as 
diagnostic reagents. For this application of the invention 
the only requirement is that the antibodies elicited have 
the capacity to interact with a Pneumovirus in a detectable 
30 manner. For example, the antibody can be coupled to" a 
detectable marker such as a radioactive label, a 
fluorescent marker, a luminescent marker or an enzyme 
v marker. The person skilled in the art will be aware of a 

great variety of suitable such markers . Thus both non- 
35 protective and protective antibodies directed against 

compounds of the invention are also within th£ scope of the 

staft/na/koep/S P EC IS/B IOMOLECULA R viral 4 6 



- 9 - 

invention. 

Compounds according to the invention may be 
directly labelled with, a detectable marker; such as those 
mentioned above for labelling of antibodies, and/or with a 
5 photoaf f inity label, and are useful for identification and 
structural characterization of the cellular receptor for 
RSV and other Pneumonoviruses . Knowledge of the structure 
of the receptor and the mechanism of its interaction with 
the G protein is useful in the design of antiviral 

10 compounds . * 

The person skilled in the art will be aware that 
anti-idiotype antibodies directed against antibodies 
according to the invention provide useful structural 
information concerning the identity and mechanism of action 

15 of the receptor site for the G protein. 

While the invention is described in detail with 
reference to human respiratory syncytial virus, it will be 
clearly understood that the invention is applicable to the 
genus Pneumovirus in general, and particularly to bovine 

20 RSV in addition to human RSV. It will be evident from the 
following description that while the sequence of the 
G protein of bovine RSV varies to some extent, there is 
significant conservation of a specific secjuence, and that 
the cysteine residues are in the same position as in human 

25 RSV G protein. The conservation of the disulphide bond 

bonding pattern in all strains tested indicates that this 
pattern has not varied during evolution of the virus, and 
that it is functionally significant. 

Brief Description of the Figures 
30 Figure 1A is a diagrammatic representation of the 

RSV G protein predicted by gene sequence analysis. The 
v cle^r area containing a single cysteine residue (I) is the 

cytoplasmic domain, the black region (II) is the 

transmembrane domain and subdomains shaded gray (III & V) 
35' are the putative heavily glycosylated regions ^of the 



staH/na/keep/SPECIS/BIOMOLECULAR.viraf 4.6 



ectodomain separated by the non-glycosylated disulphide 
subdomain (IV); 

Figure IB shows the disulphide arrangement 
determined in this study to involve pairing of cysteine 173 
with cysteine 186, and cysteine 176 with cysteine 182; 

Figure 2 shows the amino acid sequence 
encompassing residues 149-197 of the G proteins of variants 
of different subtypes of RSV, Sequences 1-15 are human RSV 
strains, sequence 1 is that of the A2 strain of the 
A subtype (Satake et al, 1985; Wertz et al, 1985), 
sequence 2 is the Long A strain of the A subtype 
(Johnson et al, 1987), and sequences 3-8 are natural 
variants of the A subtype isolated in the same locality in 
a single year (Cane et al, 1991) . Sequences 9-15 are 
natural variants of the B subtype isolated in different 
localities over a 29-year period (Johnson et al, 1987; 
Cane et al, 1991; Sul lender et al, 199 0; Sul lender et al,. 
1991) . Sequence 16 is that of Bovine RSV (Lerch et al, 
1990) . Sequences 17 and 18 are variants of human RSV, 
R10c/1 and R10c/10, which were generated by propagation of 
the Long A strain in the presence of a monoclonal antibody 
directed to the cysteine-containing constant region of the 
ectodomain of the G protein (Rueda et al, 1994); 

Figure 3 illustrates the HFLC separation of 
protease fragments of RSV strain A2 G protein produced by 
tryptic digestion of the entire protein (A) and by 
digestion of the fraction eluting at approximately 
7 3 minutes during HPLC of the tryptic digest with different 
proteases (B-D) . Chromatograms B, C and D are of digests 
obtained using pepsin, thermolysin, and post-proline 
cleavage enzyme, respectively; 

Figure 4 shows MALDI-TOF-MS spectra of components 
of the fraction eluting at approximately 7 3 minutes during 
HPLC of the tryptic digest of Figure 3A. A spectrum of an 
aliquot of the tryptic fraction is shown in A, and a 
spectrum of an aliquot of the unf ractionated peptic digest 

stafi/nartteep/SPECIS/BIOMOLECULAR.viral 5.6 



of this tryptic fraction is shown in B. The unfractionated 
peptic digestion was performed as for Figure 3, except that 
the temperature was 22°C* C and D represent spectra of 
fractions eluting at approximately 61 minutes and 
64 minutes during HPLC of the peptic digest (Figure 3B) . 
Spectrum B was recorded in the linear mode. Spectra A and 
B were recorded with matrix 3 and spectra C and D were 
recorded with matrix 4; 

Figure 5 shows the proposed identities of peptide 
fragments detected by MALDI-TOF-MS in various "digests and 
HPLC fractions. Theoretical m/z values corresponding to 
the proposed fragment identities are presented next to the 
corresponding sequence. All m/z values are for the 
oxidized sequences, except for fragments 1R, 2R and 3R, 
which are for reduced forms of these sequences; 

Figure 6 illustrates MALDI-TOF-MS spectra 
obtained by reduction of fragments 3(A) and 2(B) of the 
peptic digest of tryptic fragment 1. Matrix 4 was used to 
record both spectra; 

Figure 7 shows MALDI-TOF-MS spectra of an 
unfractionated thermolytic digest of tryptic fragment 1(A) 
and the thermolytic fragment eluting at approximately 
54 minutes during HPLC of the thermolytic digest of 
Figure 3C (B) . Conditions for thermolytic digestion were 
as described for Figure 3 except that 0.5 mM CaCl 2 was 
used. Matrix 4 was used to record both spectra; 

Figure 8 shows MALDI-TOF-MS spectra of an 
unfractionated post -proline cleavage enzyme digest of 
tryptic digest 1 (A) , the fraction eluting at 
approximately 63 minutes during HPLC (Figure 3D) of this 
digest (B) and an unfractionated post-proline cleavage 
enzyme digest of peptic fragment 2 (C) . The unfractionated 
digests were prepared as for Figure 3, except that final 
enzyme concentrations of 0 . 1 mg/ml and 10 ^ig/ml were used 
to obtain spectra A and C, respectively. Matrix 4 was used 
to record all spectra; 



staH/na/keep/SPECIS/BIOMOLECULAR.vtral 4.6 



- 12 - 

Figure 9 shows post -source decay fragment ion 
spectra of peptic fragment 2 (A), peptic fragment 3 (B) and 
post-proline cleavage enzyme fragment 5 (C) . Fragment ions 
1-7, inclusive, correspond to an N-terminal fragment ion 
series of the b-type resulting in fragmentation of the 
amino acid residues with or without the peptide, A I C K. 
Fragment ions 8-14, inclusive, correspond to a C -terminal 
fragment ion series of the y-type. This fragmentation 
Pattern is presented in a diagrammatic form in Figure 10 
together with m/z values for the numbered fragment ions of 
peptic fragment 2* An expanded portion of spectrum A 
(m/z = approximately 850 to 1650) is presented above the 
full spectrum. Matrix 3 was used to record spectra A and B 
and matrix 1 was used to record spectrum C; and 

Figure 10 illustrates the proposed fragmentation 
pattern of peptic fragment 2 based on data from Figure 9A. 

Detailed Description of the Invention 

Inspection of the deduced amino acid sequence of 
the G protein of many RSV isolates (Satake et al, 1985; 
Wertz et al, 1985; Johnson et al, 1987; Cane et al, 1991; 
Sullender et al, 1990; Sullender et al, 1991; Garcia et al, 
1994) reveals several structural domains, which are 
illustrated in Figure 1. The N-terminal region 
(residues 1-38) is located on the inner aspect of the viral 
envelope, and is relatively conserved, as is the 
transmembrane region (residues 39-66). However, the 
ectodomain (232 residues) has two regions of comparative 
variation bordering a central region (residues 149-197), 
which is conserved within subgroups and contains 4 cJLosely 
positioned cysteine residues which are conserved in all RSV 
sequences. As shown in Figure 2, this region also has a 
sequence of 13 amino acids, including 2 of the conserved 
cysteine residues, which is identical in all wild type 
isolates of RSV that infect humans. The variable regions 
contain potential sites for N-linked glycosylation of 

staft/na/koep/SPECIS/BIOMOLECULAR.viral 4.6 



- 13 - 

asparagine, and have an abundance of serine and threonine 
residues which are potential sites for O-linked 
oligosaccharides. Thus the ectodomain comprises 7 
occurrences of the consensus tripeptide sequence 
asparagine-Xaa- threonine/ serine, the motif for asparagine - 
linked glycosylation, although at three of these sites Xaa 
is proline, which is a contraindication of such 
glycosylation. A relative abundance of proline residues in 
the regions high in serine and threonine suggests the 
presence of O-linked oligosaccharides. Some functional 
studies indicate an active role for the O- and N-linked 
oligosaccharides of the RSV membrane proteins in cellular 
invasion (Lambert, 1988), but systematic analyses of the 
structures and positioning of the oligosaccharides remain 
to be performed. 

Experimental Procedures 
Peptide Isolation 

Strain A2 RSV G protein was isolated by 
immunoaf f inity chromatography (Walsh, 1984) , with 
modifications to include immunoaf f inity columns specific 
for RSV F and nucleocapsid proteins prior to a final 
G protein antibody column and elution from the affinity 
column with potassium thiocyanate. Lyophilized G protein 
samples were reconstituted in sufficient 0.1 M NH 4 HC0 3 to 
result in a final concentration of 0.01% Triton X-100 
(v/v),0.14 M NaCl and 10 mM phosphate buffer (pH 7.2 in 
the absence of NH 4 HC0 3 ) . N-ethylmaleimide was also added 
to a final concentration of 1 mM. Digestion of the intact 
protein was performed for 4 hours at 37 °C using two ^ 
additions of 1% (w/w) of sequencing grade trypsin 
(Boehringer-Mannheim) with the second addition made at 
2 hpurs. Subdigestion of peptide fragments isolated by 
high performance liquid chromatography (HPLC) was achieved 
with pepsin (Boehringer-Mannheim), thermolysin (Calbiochem) 
and/or post -proline cleavage enzyme (Seikagakji Corporation, 



staff/na/keep/SP EC IS/BIOMOLECULAR. viral 4.6 



- 14 - 

Tokyo) . Details of individual subdigestion protocols are 
described below in association with specific experiments. 

Proteolytic fragments were isolated by reverse 
phase HPIiC (RP-HPLC) using slight variations of a 
5 previously described protocol ( Gorman et al, 1990), using a 
2.1 mm x 25 cm column of octadecasilica (Vydac), a flow 
rate of 150 |ll/min and a linear gradient from 0.1% (v/v) 
aqueous trif luoroacetic acid to 80% (v/v) aqueous CH 3 CN 
containing 0.09% (v/v) trif luoroacetic acid, developed over 
10 90 minutes. Gradients were generated using a Hewlett 

Packard chromatography system comprising a 1090M solvent 
delivery system under the control of a DOS Chemstation, and 
elution of peptides was monitored at 214 nm using a 10 90 
diode array detector. 

15 Mass Spectrometry 

Matrix-assisted laser desorption/ionisation time 
of flight mass spectrometry (MALDI-TOF-MS) was performed 
using a Bruker Reflex mass spectrometer (Bruker-Franzen 
Analytik GmbH, Bremen, Germany) . Samples were deposited on 

2 0 to target surfaces after mixing with an equal volume of the 

supernatant fraction of a saturated mixture of a-cyano-4- 
hydroxy-cinnamic acid (matrix 1) or 3 , 5 -dimethoxy-4 - 
hydroxy- cinnamic acid (matrix 2) in 33% (v/v) aqueous CH 3 CN 
containing 0.1% (v/v) trif luoroacetic acid (Beavis et al, 

25 1992), or a 10 mg/ml solution of a-cyano-4 -hydroxy-cinnamic 
acid in 50% (v/v) C 2 H 5 OH/CH 3 CN (matrix 3), or a 10 mg/ml 
solution of 2, 6-dihydroxyacetophenone in 50% (v/v) 
C 2 H 5 OH/CH 3 CN containing 0.1 M di -ammonium hydrogen citrate 
(matrix 4) (Gorman et al, 1996). Ionization was achieved 

30 using a nitrogen laser pulsed at a repetition rate of 3 Hz. 
Laser irradiance was adjusted to threshold levels in order 
to pbserve intact molecular ions, which were accelerated to 
a potential of 28.5 kv on to a hybrid microchannel plate- 
photomultiplier linear detector or subsequently reflected 

3 5 with a reflectron potential of 3 0 kV on to a dual 



staWna/keep/SPECtS/BIOMOLECULAR.viral 4.6 



- 15 - 

microchannel plate detector. Except where specifically 
noted, molecular ion masses were determined in the 
reflectron mode. Matrix ions were deflected by application 
of a 2 kV potential across deflector plates placed 
immediately after the ion acceleration region in order to 
avoid saturation of the detectors. All measurements were 
made at a digitization rate of 250 Mhz . Masses were 
assigned to intact peptide ions by reference to an external 
calibration file created using the flight times of the 
components of a mixture of 200 fmoles of angiotensin II 

(MH + = 1046.56; monoisotopic mass), 400 fmoles of 
adrenocorticotropic hormone residues 18-39 (MH + = 2466.73; 
average mass) and 2 pmoles of bovine insulin 

(MH + = 5734.54; average mass) applied to a separate target 
spot . 

Analysis of metastable ions arising from post- 
source decay was performed using 25% stepwise decrements in 
the reflectron potential and increasing the laser 
irradiance to optimise production of ions in each voltage 
window (Spengler et al # 1992; Kaufmann et al, 1993; 
Kaufmann et al, 1994) . Masses were assigned to metastable 
ions by reference to a calibration table created by 
determining the behaviour of metastable ions of known mass, 
produced from adrenocorticotropic hormone residues 18-39 , 
at various reflectron potentials (Rouse et al, 1995). As 
described above, data were acquired at a digitization rate 
of 250 MHz. Assembly of the individual spectra for each 
reflection voltage on to a continuous mass scale was 
performed using Bruker FAST software routines within the 
Bruker XMASS software package. 

Reduction of Di sulphide Bonds 

* Disulphide containing peptides were reduced, 

after adjusting HPLC fractions to pH 5 by the addition of 
1 M aqueous di - ammonium hydrogen citrate to a final 
concentration of 0.1 M, by addition of 50 mM ^queous 



statt/na/keep/SPECIS/BtOMOLECULAR.vfral 4.6 



- 16 - 

tris (2-carboxyethyl) -phosphine (Molecular Probes) to a 
final concentration of 5 xnM and incubating the mixtures at 
65°C for 20 minutes. Reduced samples were mixed with, an 
ecjual volume of 2 , 6 - dihydroxyace t ophenone in 50% (v/v) 
5 C 2 H 5 OH/CH 3 CN and applied to a sample target for mass 
spectrometric analysis . 

E<1 man Degradation 

Stepwise amino acid sequence analysis of peptides 
was performed by automated Edman degradation using a 
10 Hewlett Packard G1000A solid-phase protein sequenator. 

Example 1 Disulphide Determination by Analysis of 

Proteolytic Fragments 
The whole G protein was digested with trypsin, 
and a fraction of the tryptic digest was then subjected to 

15 further digestion with pepsin, thermolysin, or post-proline 
cleavage enzyme as described. The digests were analysed by 
RP-HPLC, and the results are summarized in Figure 3. 

Chromatogram A resulted from injection of 100 |il 
of a digest which contained approximately 40 |ig of 

20 G protein exposed to trypsin for 4 hours at 37°C (see 
experimental procedures for detailed conditions) . 
Subdigestions of the fraction eluting at 73 minutes in 
chromatogram A were achieved after removal of the CH 3 CN 
using a stream of high purity nitrogen. Pepsin (10 fil of a 

25 1 mg/ml solution in 5% (v/v) formic acid) was added to 

100 |il of the fraction and digestion was allowed to proceed 
for 2 h at 37°C prior to injecting 100 \il of the digest to 
generate chromatogram B. The fraction was prepared "for 
thermolytic digestion by mixing 45 |il of the fraction with 

30 45 (il of 0.1 M NH 4 HC0 3 and 11 |il of 0.01 M CaCl 2 . 

Threm&lysin (10 |il of a 1 mg/ml solution in 0.1 M NH 4 HC0 3 ) 
was added and digestion allowed to proceed for 2 hours at 
37°C prior to injecting 100 |il to generate chromatogram C. 
The-fraction was prepared for post-proline cleavage enzyme 

staWna/k»ep/SPECIS/B!OMOLECULAR.viral 4.6 



- 17 - 

digestion by mixing 45 |il of the fraction with. 20 (Xl of 
0.1 M NH 4 HC0 3 before adding 45 fil of 0.1 mg/ml solution of 
the enzyme in the enzyme in 0.1 M ammonium acetate to give 
a final pH of 6.5. Digestion was allowed to proceed for 
2 hours at 37°C prior to injecting 100 |il of the digest to 
generate chromatogram D. 

The following fragments were produced by 
enzymatic digestion: 

Fragment 1 
Fragments 2 & 3 
Fragment 4 

Fragment 5 

Fragment 6 

Fragment 2R 
Fragment 3R 

Cleavage with Trypsin 

RP-HPLC of tryptic digests of the G protein 
consistently revealed only a few discrete peaks of 
absorbance at 214 nm at the column breakthrough and at 
approximately 10.5, 25 and 73 minutes (Figure 3A) . The 
remainder of the chromatogram consisted of several series 
of broad baseline rises consisting of poorly resolved peaks 
of absorbance at 214 nm. This appearance is consistent 
with extensive attachment of heterogenous oligosaccharides 
to most of the peptide constituents of the digest. The 
peak eluting at 10.5 minutes represented N-ethylmaleimide, 
which was added to the digest as a precaution against 
dis\ilphide bond interchange or disulphide bond oxidation of 
cysteine-containing fragments. The peak eluting at 
10.5 minutes was not present in a digest from which 
N-ethylmaleimide was omitted. 

staH/na/koep/SPECIS/BIOMOLECULAR.virat 4.6 



Trypsin cleavage o£ G protein; 
Pepsin cleavage of Fragment 1; 
Thermo ly sin cleavage of 
Fragment 1 ; 

Cleavage of Fragment 1 with 
post-proline protease; 
Cleavage of Fragment 2 with 
post-proline protease; 
Reduced Fragment 2; and 
Reduced Fragment 3 . 



- 18 - 

The other early-eluting discrete peaks failed to 
produce data when examined by MALDI-TOF-MS; however, as 
shown in Figure 4A, the peak at 73 minutes produced intense 
ion signals at m/z values of 4108.0 and 4125.3. These 
5 masses are consistent with the tryptic peptide spanning 

residues 152 to 187 of the G protein (Fragment 1), taking 
into account oxidation of the four cysteines of this 
sequence to cystine residues, which leads to a loss of 4 Da 
in mass from both peptides. Partial conversion of the 

10 N-terminal glutamine residue 152 to pyroglutamic acid 

apparently accounts for the difference of 17 Da in mass 
between these ions . These data also indicate that 
Fragment 1 does not have any carbohydrate attached to the 
asparagine residue at position 179, the threonine residue 

15 at position 181, or the serine residues at positions 157, 
174 and 177. 

Subcleavage with Pepsin 

Cleavage of isolated Fragment 1 with pepsin and 
analysis of the unf ractionated digest of MALDI-TOF-MS 

20 produced ions consistent with residues 152 to 164 and 

residues 169 to 187 (Figure 4B) . Ions at m/z = 1653.2 and 
m/z = 167 0.0, representing residues 152 to 165, were spaced 
by 17 Da, indicating a mixture of glutaminyl and 
pyroglutamyl residues derived from the N-terminus of the 

25 original tryptic Fragment 1. Ions at 2096.7 and 2115.7 
represented the C-terminal portion of Fragment 1 
(residues 169 to 187), allowing for peptide bond cleavage 
within a cystine loop of the heavier peptide (Fragments 2 
and 3). As for the parent tryptic peptide, 4 Da had" to be 

3 0 subtracted from the theoretical masses of the C- terminal 
portions to obtain correspondence with the experimentally 

V observed masses. This provided further evidence that the 

four cysteine residues of the secjuence were in disulphide 
linkage . 



staft/na/keep/SPECIS/B IOM OLE CULAR. viral 4.6 



Fragments 2 and 3 were isolated by RP-HPLC 
(Figure 3B) and analysed as isolated fragments which 
provide corroboration of the identities assigned to ions in 
the unf ractionated digest, especially the spacing of 18 Da 
of ions representing the C- terminus of tryptic Fragment 1, 
which indicated intra-disulphide loop peptide bond cleavage 
for the heavier ion. Peaks at 52 and 52.5 minutes 
corresponded to N-terminal peptic fragments, the peak of 
61 minutes corresponded to the heavier of the C- terminal 
peptic fragments (Fragment 2; Figure 4C) , and^the peak at 
64 minutes corresponded to the lighter of the C-terminal 
fragments (Fragment 3; Figure 4D) . Mass analysis of these 
peptides after chemical reduction showed an increase in 
mass of 4 Da for the lighter of the C-terminal fragments, 
consistent with acciuisition of four protons as a 
consequence of reduction of two disulphide bonds 
(Figure 6A) , The heavier of the unreduced peptides 
actually lost mass, corresponding to loss of the sequence 
Ala-Ile-Cys-Lys and gain of 3 protons following reduction 
of two disulphide bonds (Figure 6B) . These data indicate 
that the peptic cleavage within a disulphide loop to 
produce the heavier of the C-terminal peptic fragments 
(Fragment 2) occurred after the sole tryptophan of the 
parent tryptic Fragment 1 (tryptophan 183 of the 
G protein) . 

Subc leavage with Thermo lys in 

Cleavage of the tryptic Fragment 1 with 
thermolysin and direct analysis of the digest reaction by 
MAIiDI -TOF-MS produced ions consistent with cleavages* prior 
to phenylalanine residue 19 of Fragment 1 and prior to 
isoleucines at positions 24 and 34 of Fragment 1. Ions 
diagnostic of these cleavages were at m/z values of 912.9 
and 1106.9 (Figure 7A) . The ion at m/z = 912.9 is 
consistent with disulphide bridging between 17 3 and 186 
(residues 19-23 of Fragment 1 linked to residues 34-36), 

staff/na/keepySPECIS/8 IOM OLECULAR viral 4.6 



- 20 - 

and the ion at m/z = 1106.9 is consistent with disulphide 
bridging between cysteines 17 6 and 182, but without 
cleavage of intra-cystine-loop peptide bonds. RP-HPLC of 
the thermolytic digest revealed a complex series of peaks, 
5 as shown in Figure 3C. Most of these peaks appeared to 

have been derived from the enzyme preparation, as they were 
present in a chromatogram produced using a mock digest 
which lacked Fragment 1 . Some of these peaks produced ions 
which did not correspond to any portion of Fragment 1. The 

10 digest had one peak at approximately 38.5 minutes which 
yielded an ion at m/z = 912.2, consistent with the 
cystene 173 to 186 loop. However, this fraction was a 
mixture of peptides, as evidenced by other ions at higher 
m/z values, and was not characterized any further. The 

15 fraction eluting at approximately 54 minutes was not 

present in the mock digest, and yielded the correct mass 
for the peptide containing the cysteine 17 6 to 182 loop 
(Fragment 4), as shown in Figure 7B. Automated Edman 
degradation sequencing produced an N-terminal sequence of 

20 Ile-Xaa-Ser-Asn, which is consistent with the 

identification of this fragment by mass analysis, with Xaa 
representing a gap corresponding to retention of the PTH 
derivative of one half cystine residue on the sequencer 
cartridge due to disulphide linkage to its half cystine 

25 pair. 



Subcleavage with Post-Proline Cleavage Enzyme 

Cleavage of tryptic Fragment 1 with a post- 
proline cleavage enzyme produced a fragment (Fragment 5) 
containing the two disulphides, but without intra - 

3 0 disulphide loop cleavage at the proline residue in the 

cystine 17 6 to 182 loop. Ions with m/z values indicative 
of .this fragment (m/z = 1639.7) were observed in both the 
. digestion reaction mixture (Figure 8A) and in a fraction at 
approximately 63 minutes generated by HPLC of the reaction 

35 mixture (Figure 3D). This is shown in Figure^8B. Post- 



statt/na/keep/SPEClS/BIOMOLECULAR.viral 4.6 



proline cleavage of the heavier of the C-terminal peptic 
fragments of Fragment 1 (ie. Fragment 2) also achieved 
cleavage at the extra -cysteine loop proline but not the 
intra-loop proline (Fragment 6), as indicated by appearance 
of an ion at m/z = 1658.8 in the digest illustrated in 
Figure 8C. We attempted to obtain further fragments 
consistent with the disulphide bond pattern indicated 
above, using cleavage strategies involving chymotrypsin, 
proteinase K and mild acid; however, these strategies 
either failed to contribute additional data or were not 
rewarding at all. 

The residues involved in each fragment, and their 
observed and theoretical m/z ratios, are summarized in 
Table 1, and the structure of each fragment, indicating its 
disulphide bonds, is shown in Figure 5. It can be seen 
that the pattern of disulphide bonding is the same in each 
fragment, and that this pattern is destroyed by reduction. 



staft/na/keep/SPEClS/BIOMOLECULAR.viral 4.6 



m 
<D 

T» 
•H 

cq 
o 
PC 

a) 
a 

-H 

a) 

a 
>i 
a 
\ 
a) 
c 

-H 
4J 
CQ 
>l 
U 



cd 
£ 
o 
■d 
o 

o 

W 

tn 

a 
a 

•H 

(0 
JJ 

a 
o 
u 



o 

■H 

JJ 
a) 

o 
a) 



o 



a) 
> 
M 
a> 
m 
Xj 
o 



00 

o 



CO 



in 
eg 



CD 

a) 
■d 

-H 

m 
a) 
PC 



CO 



m 



<D 

cd 
> 
(d 
a) 

rH 

u 



CD 



a) 
d» 
cd 

(d 
a> 

o 

-H -H 

co a) 

&2 

H ft 



JJ 

c 

CD 

(d 
fa 



in 
in 

rH 
<N 



co 



cn 
vo 



0) 

m 

> 
cd 
<D 

rH 

o 



CQ 
ft 



eg 



en 



CN 



cn 



CN 



co 



O 
o 

05 
> 
cd 

rH 

o 



CQ 

ft 

CD 
Oi 



CO 



4-1 

o 

CD 

cd 

> 

cd 

0) 
rH 

o 



CQ 



to 



CO 
VO 



cn 
co 
vo 



co 



co 



a> 

CQ 

cd 
o> 

JJ 

o 
u 
a* 

0) 

-rl o 

0 a) 
n cn 

01 cd 
i > 

JJ cd 

m <d 

O rH 
A* o 



00 

in 

vo 



co 

oo 
in 
vo 

rH 



CO 



co 



CQ 

cd 

CD 
JJ 

o 
u 

CN 

CD 

-H O 
rH 

0 <D 
H D) 
04 cd 

1 > 
JJ cd 

CQ 0) 
O rH 

& a 



vo 



cn 

in 
co 
vo 



in 

CO 

vo 



CO 
CO 



en 
vo 



CN 



C 
O 
•H 
JJ 

O 

-3 

PC 



PC 



in 

rH 
O 
tH 
CN 



CN 



00 



en 
vo 



4-1 

o 

C 

o 

•H 
JJ 

O 

■d 

PC 



PC 
co 



CQ 

-d 
c 

O CD 
X* X! 
JJ 

CD 

»d 4H 
•H O 
X! 

ft a 

rH O 

3 -H 

CQ 4-> 

-rl Cd 

•a n 

-rl* 
C rH 
-H U 
_ >1 

CD 

> >i 
t-c XI 
o 

> u 
c o 



u 
o 



JJ 

o 
o 
o 
cd 

o 

4J 

CQ 

a 

0) 



CD 

JJ 

4-1 

o 

CQ 
CQ 

o 



*d 
a 
cd 

CO 
CO 



cd -d 

U CD 
Cd JJ 



3 

o 
o 
u 
cd 

CQ 

a) m 



CQ 
CD 

■d 



CD CQ 

a a 

-H O 

CD -rl 
JJ 

CQ CD 
>1 CQ 

a cd 

X! 
U JJ 

8a 

<H CD 
CD 

rH JJ 

cd CD 

X) 

JJ 

cd cd 
x: a 

JJ 

tJi rH 
C 

•rl <W 
£ O 

CQ CD 
CO U 

cd C 

CD 

•d u 

CD CD 
4-> 4-1 
cd 4H 



•d 

CD 

o 

•d <d 
cd ^ 

2 ° 

■H oo 

^° 

og 
in 

0 Q) 

e ° 

2-g 

5 « 

d»x) 



cd 

x: 

ft * 
o c 

4J o 

SI 

<w CD 

o u 
a o 

o ft 
XI 0 

CD CD 

*d o 
-h a 

JJ CD 

ft a) 

CQ 

rH 

cd 
a 

-H O 



8 



^ «d Jj 0 

o a CJ 

H Q) ^ H 

cd jsi < cd 



H 

CD 

JJ < 

o 

CD ^ 

x: c 

JJ -H 
rH 
JJ I 
Cd CD 

•d 

CD -H 

cj»X! 
cd ft 

> rH 

cd 

CD CQ 
rH -H 

O *d 



3 

U 
UJ 

_l 

'O 
• 2 
O 



f 



- 23 - 

Example 2 Bisulphide Determination by Mass 

Spectrometric Based Sequence Analysis 
of Fragments 2, 3 and 5 
Analysis of products of gas-phase metastable 
5 decomposition of ions of peptic Fragment 2 by MALDI-TOF-MS 
produced a series of fragment ions illustrated in 
Figure 9A, which were consistent with the disulphide 
bonding pattern deduced by analysis of proteolytic 
fragments (Figure 5). The series of fragment ions obtained 

10 is represented both diagrammatical ly and in tabular form in 
Figure 10 . These fragment ions are of the b- and y-type 
(Roepstorff and Fohlman, 1984) representing cleavages at 
the peptide bonds along the peptide backbone. Fragment 
ions 1 to 4 inclusive are a series of b-type ions which are 

15 independent of possible disulphide bonding arrangements, 
but fragment ions 5 to 7 inclusive are diagnostic of the 
indicated disulphide linkage, due to the concomitant loss 
of mass of the A I C K sequence together with 
fragmentations at Cysl7 3 or Serl74 or Ilel7 5 of the larger 

20 peptide chain. Furthermore, this diagnosis is supported by 
the occurrence of fragment ions 8 to 11, which also bear 
the mass of the A I C K sequence, and by the failure to 
observe this mass accompanying fragment ions 12 to 14. 

A comparable analysis of peptic Fragment 3 

25 revealed the sequence specific b-type fragment ions 2, 3 

and 4 seen for peptic Fragment 2, which are independent of 
the disulphide bonding pattern which are shown in 
Figure 9B. Fragment ions of Fragment 3 at m/z values of 
1983.3, 1836.7 and 1735.6 are potentially equivalent to 

30 y-type ions 8, 9 and 10 respectively of Fragment 2 when a 
mass difference due to inclusion of an additional peptide 
bond in Fragment 3 is taken into account . Other prominent 
fragment ions of Fragment 3 were apparent at m/z values of 
288.4, 648.9 and 719.1. Only the ions at m/z values of 

35 648.9 and 719.1 were also seen with Fragment 2. The ion at 
m/z.,= 64 8.9 can be rationalised as a b-type ipn resulting 

staff/na/keep/SPECIS/BIOMOLECULAR. viral 4.6 



- 24 - 

from cleavage between Serl74 and Ilel75 of the peptide, 
plus cleavage of tlie disulphide bond involving Cysl73. The 
ions at m/z values of 288.4 and 719.1 cannot be easily 
accounted for. None of the fragment ions of Fragment 2 
5 which were used to support the disulphide bonding 

arrangement were observed with Fragment 3 . This supports 
the logic used in interpretation of the ion series of 
Fragment 2 used to define the disulphide pattern. This 
logic was dependent upon identifying ions as being 

10 combinations of metastable ion masses due to cleavage along 
the peptide backbone at the N-terminus of the Fragment 2 
plus mass contributed by the disulphide- linked 
A I C K sequence. The A I C K sequence was also linked by 
a peptide bond to Trpl83 in Fragment 3, which explains why 

15 it was not liberated together with metastable fragment ions 
from this peptic fragment . 

As shown in Figure 9C, post -proline cleavage 
Fragment 5 failed to produce metastable fragment ions of ^ 
comparable intensity to those produced by Fragments 2 and 

2 0 3. These observations with Fragment 5 support the 

conclusions drawn from the data on Fragments 2 and 3. 
Fragment 5 does not have the N- terminal N F V P sequence, 
or the cleavage following Trpl83 required to produce 
fragment ions equivalent to fragment ions 1 to 4 and 8 to 

25 11 seen with Fragments 2 and 3. 

As shown in Figure 10 and Table 2, the results of 
analysis of metastable ions produced by peptic Fragments 2 
and 3 and post-proline cleavage Fragment 5 (Figure 9) were 
consistent with the pattern deduced by analysis of 

30 proteolytic fragments as shown in Figure 5. 

Values in Table 2 for the observed fragment ions 
are an average of three separate determinations with peptic 
Fragment 2. Fragment ions 5-7 (b-type ions), inclusive, 
and 8-11 (y-type ions), inclusive, are diagnostic of the 

35 proposed disulphide pattern, because they account for the 
mass- of the linked peptide, AICK, in addition^to the mass 

staft/na/koep/SPECIS/BIOMOLECULAR.viral 4.6 



- 25 - 

produced by the cleavage of amino acids as indicated. 



staH/na/keep/SPECIS/BIOMOLECULAR.vtral 4.6 




5 

o 

LU 

d 

O 



- 27 - 

Example 3 Synthesis of Disulphide-Bonded peptide 

A fully synthetic peptide of sequence 

corresponding to that of Peptide 1 in Figure 2 (SEQ ID 

NO: 1), ie. amino acids 149 to 197 inclusive of the 
5 G protein of RSV strain A2, was prepared using conventional 

solid-phase methods. 

This peptide preferentially formed the same 

disulphide bonding pattern as that identified in the 

previous examples . 

10 Discussion 

We have now shown that the disulphide bonding 
pattern of the ectodomain of the unusual attachment protein 
or G protein of RSV involves a preferred stable 
configuration with Cysl73 linked to Cysl86, and Cysl7 6 

15 linked to Cysl82. This was achieved by a combination of 
analysis of proteolytic fragments of the protein and 
further analysis of metas table ions produced from the 
proteolytic fragments during MALDI-TOF-MS . These findings 
represent a potent demonstration of the utility of 

20 MALDI-TOF-MS for the mass analysis and structural 

elucidation of peptides, with simultaneous characterization 
of post-translational modifications. The observation that 
the synthetic peptide corresponding to residues 149 to 197 
formed the same disulphide bond arrangement as the viral 

25 protein strongly indicates that the preferred configuration 
is actually the sole configuration 

It is apparent that the disulphide loop between 
Cysl7 6 and Cysl82 forms a restricted region of 
accessibility, since the proline residue in this loop was 

30 not vulnerable to post-proline cleavage enzyme under 

conditions where another proline residue preceding the 
disulphide loops was cleaved. In contrast the loop between 
Cysl73 and Cysl86 appears to be more accessible, since 
peptide bonds within this loop were cleaved by both pepsin 

35 and - thermolysin , ^ 

staff/na/koep/SPECIS/BIOMOLECULAR.virai 4.6 



- 28 - 

Our results are surprising in view of the 
extremely high degree of post-translational glycosylation 
of the RSV G protein. The characteristics of the 
chromatogram obtained by HPLC of a tryptic digest of the 
5 G protein indicated extensive glycosylation of the 

ectodomain. However, it is evident that the region of the 
G protein ectodomain which we have defined does not carry 
any oligosaccharides. Furthermore, the 3 and 10 residues 
attached to the N- and C- terminal ends, respectively, of 

10 this region do not have the potential for glycosylation. 

This represents 49 of the 232 residues of the ectodomain, 
or approximately 20%, which are not glycosylated. 

Thus it is apparent that the ectodomain of the 
G protein has a subdomain structure, in which two highly 

15 glycosylated subdomains of 83 and 101 amino acid residues 
are separated by a comparatively smaller non-glycosylated 
subdomain which has a highly defined disulphide bond 
arrangement . The occupancy status of the remaining 
potential glycosylation sites and the characteristics of 

20 the glycans are yet to be determined. Definition of the . 
glycosylation of the subdomains is essential in order to 
assess the contribution of oligosaccharides to the 
mechanism of action and immunobiology of the G protein 

The region of the G protein ectodomain containing 

25 the disulphides and the peptide sequences immediately 
adjacent to the N- and C- terminal ends of this region 
appear- to have functional significance for both receptor 
interactions and immunological reactivity of the G protein. 
A reassessment of earlier antigenic analyses in the light 

30 of our results has indicated an important role for the 

disulphides in maintaining the structural integrity of the 
protein. Studies with nested sets of synthetic peptides 
representing overlapping portions of the ectodomain have 
demonstrated that rabbit polyclonal antibodies and murine 

35 monoclonal antibodies to the G protein and human 

convalescent sera from natural infection all react in 

staft/nartteep/SPECtS/BIOMOLECULAR.viral £.6 




- 29 - 

common with a peptide containing three of the four 
cysteines of the ectodomain (Norrby et al, 1987). The 
rabbit antisera also reacted with a variety of peptides, 
but the convalescent sera only reacted with two other 
5 peptides, one of which overlapped the commonly reactive 

peptide and another closely positioned peptide, while the 
monoclonals only reacted with the commonly reactive 
cysteine containing peptide. It is possible that a wider 
spectrum of antibody reactivities with the G protein might 
10 have been evident had the epitope scanning experiments 
utilised glycosylated domains of the G protein. 

Subsequent studies indicated that the cysteine 
containing region formed a subgroup-specific antigenic 
determinant, and that intact disulphide bonds were 
15 important for this characteristic ( Akerlind-Stopner et al, 
1990). Further support for the immunological importance of 
this region came from studies with escape mutants generated 
using a neutralising monoclonal antibody (Rueda et al, 
1994) . These escape mutants had mutations at either Cysl82 
20 or Cysl86 of their G proteins (Figure 2) . The overall 

effect of these changes was apparently sufficient to enable 
the mutant viruses to escape neutralisation but to retain 
functionally effective G proteins, as indicated by their 
ability to compete with wild-type virus in infectivity 
25 assays. Thus, it is possible that the mutant viruses had 
subtle, but immunochemically significant, differences in 
the surface chemistry of the cysteine regions of the 
ectodomains of their G proteins, while retaining a 
functionally competent structural fold. From our results 
3 0 it is apparent that these mutants retained the ability to 
form one of the two correct disulphides of the ectodomain, 
that is either the Cysl73 to Cysl86 or the Cysl76 to Cysl82 
linkage. The replacement of the cysteine residues by 
arginine residues in both cases presumably compensated for 
the loss of stability associated with loss of a disulphide 
bond by replacement with a residue wit the ability to form 

staff/na/keep/SPECIS/BIOMOLECULARviral 4.6 



- 30 - 

a salt bridge. 

Tlie functional importance of the disulphide 
region is shown by studies using various polypeptides 
produced by recombinant routes or by chemical synthesis to 
5 immunize experimental animals against challenge with live 

RSV. A recombinant vaccinia virus expressing a polypeptide 
bridging the disulphide region (residues 1-230 of the 
G protein) has been shown to produce neutralising 
antibodies and to confer protection from challenge with 

10 live RSV, with a response equivalent to that Elicited by a 
recombinant vaccinia virus expressing full length 
G protein. In contrast, a recombinant vaccinia virus 
expressing a polypeptide terminating at residue 180 
(residues 1-180) failed to provide protection (Olmstead et 

15 al # 1989). Another recombinant vaccinia construct 

encompassing residues 124-203, also conferred protection , 
(Simard et al, 1995) . The importance the disulphide region 
is also illustrated by finding that a synthetic peptide 
containing three of the cysteine residues 

20 (residues 174-187), which conferred protection from 

challenge by live RSV despite the fact that only non- 
neutralising serum antibodies were produced (Trudel et al,, 
1991) . 

Nested sets of synthetic peptides have also been 
25 used to attempt to define the portion of the G protein 

which interacts with the cellular receptor for RSV (Feldman 
and Hendry/ 1996); however, none of these peptides from the 
ectodomain blocked binding of the G protein to RSV- 
susceptible cells. It was postulated that the peptides may 
3 0 have lacked secondary or tertiary structural elements 

required for interaction with receptors. However, as with 
the immunological studies, the lack of oligosaccharides on 
thej3e ^peptides appears to have been overlooked by previous 
. workers. Furthermore, the disulphide bond status of these 
35 peptides does not appear to have been addressed. It is 

conceivable that the structural elements missing from the 



sUH/na/keep/SPECtSmiOMOLECULAR.viral 4.6 



- 31 - 

synthetic peptides involve correct di sulphide bond pairing 
or another element contained by such disulphide bonds . 

Knowledge of the actual disulphide linkage 
pattern, of G protein ectodomain facilitates experimental 
5 assessment of the functional and immunological roles of the 
disulphides and neighbouring peptide sequences . Our 
results have enabled us to develop strategies for synthesis 
of peptides with the correct disulphide bridging to probe 
receptor binding and immunological interactions of this 

10 portion of the G protein. Furthermore, we havfe 

demonstrated that synthetic strategies for this subdomain 
of the G protein did not need to consider glycosylation. 
Peptide by-products with the incorrect disulphide bonding 
arrangements are theoretically possible from synthetic 

15 approaches; however, these can easily be identified by 
routine methods, and will serve as control peptides for 
assessment of the relevance of the particular disulphide 
pattern determined herein. In fact, no such peptide by- 
products were observed. Peptide products with residual 

20 blocking groups were obtained, and can also be used as 
controls . 

It will be apparent to the person skilled in the 
art that while the invention has been described in some 
detail for the purposes of clarity and understanding, 

2 5 various modifications and alterations to the embodiments 

and methods described herein may be made without departing 
from the scope of the inventive concept disclosed in this 
specification. 

References cited herein are listed on the 

3 0 following pages, and are incorporated herein by this 

reference. £ 

I 

staH/na/keep/SPECIS/BIOMOLECULAR.viral 4.6 ^ 

f 



REFERENCES 

1. Akerlind-Stopner, B., Utter, G . , Mufson, M.A., 
Orvell, C. , Lerner, R.A. and Norrby, E . 
Journal of Virology, 1990 S± 5143-5148 

2. Alansari H. and Potgieter, N . D . 
Virology, 1993 196 873-877 

3. Beavis, R.C., Chaudhary, T. and Chait, B.T. 
Organic Mass Spectrometry, 1992 ,27 156-15B 

4. Cane, P. A., Matthews, D.A. and Pr ingle, C.R. 
Journal of General Virology, 1991 72, 2091-2096 

5. Cane P. A. and Pr ingle, C.R. 
Seminars in Virology, 1995 6. 371-378 

6 . Choreo and Goodman 

Acc. Chem. Res., 1993 2£ 266-273 

7. Collins, P.L. 

The Paramyxoviruses. Kingsbury, D.W. ed New York: 
Plenum Press; 1991 103-161 

8. Dore-Duffy, P. and Howe, C. 

Proc. Soc. Exper. Biol. Med., 1978 157 622-625 

9. Feldman S.A. and Hendry, R.M. 

Proceedings of the 13th Annual Meeting of the American 
Society for Virology, 1996 

10. Garcia, O., Martin, M., Dopazo, J., Arbiza, J., 
Erabasile, S., Russi, J., Hortal, M. , Perez-Brena, P., 
Martinez, I., Garcia-Barreno, B. and Melero, J. A. 
Journal of Virology, 1994 68 5448-5459 



stafi/na/keep/SPEClS/BIOMOLECULAR. viral 4.6 



Gorman, J.J., Corino, G.L. and Shiell, B.J. 
Biomed. Environ. Mass Spectrometry, 1992 27. 156-158 

Gorman, J. J., Ferguson, B.L. and Nguyen, T.N. 
Rapid Communications in Mass Spectrometry, 199 6 10_ (5) 
529-536 

Gruber C. and Levine, S. 
Journal of General Virology, 

Gruber C. and Levine, S. 
Journal of General Virology, 

Hall, C.B. 

Science, 1994 265 1393-1394 
Heilman, C.A. 

Journal of Infectious Diseases, 1990 161 402-406 

Henderson, F.W., Collier, A.M., Clyde Jr, W.A. and 
Denny, F.W. 

New England Journal of Medicine, 197 9 300 530 

Johnson, P.R., Spriggs, M.K., Olmsted, R.A. and 
Collins, P.L. 

Proc. Natl. Acad. Sci. USA, 187 84. 5625-5629 

Kaufmann, R., Kirsch, D. and Spengler, B. 
International Journal of Mass Spectrometry and Ion 
Processes, 1994 131 355-385 

Kaufmann, R., Spengler, B. and Lutzenkirchen, F. 
Rapid Communications in Mass Spectrometry, 19 9 3 .7 
902-910 



1983 64. 825-832 
1985 66 417-432 



staft/na/keep/SPECIS/BtOMOLECULAR.virat 4.6 



- 34 - 

Kingsbury, D.W. 

Virology. Fields, B.N./ Knipe, D.M. ed New York: 
Raven Press, 1990 945-962 

Lambert, D.M. 

Virology, 1988 164 458-466 

Lambert D.M. and Pons, M.W. 
Virology, 1983 130 204-214 

Lerch, R.A. , Anderson, K. and Wertz, G.W. 
Journal of Virology, 1990 64_ 5559-5569 

Levin, M.J. 

Journal of Paediatrics, 1994 124 S22-S27 
Levine , S . 

Journal of Virology, 1977 21 427-431 
Mcintosh, K and Chanock, R.M. 

in Virology: Fields, B.N. and Knipe, D.M. (eds) Raven 
Press NY (1990) 1; 

Markwell, M.A. 

The Paramyxoviruses. Kingsbury, D.W. ed New York: 
Plenum Press; 1991 407-426 

Morrison T. and Portner, A. 

The Paramyxoviruses. Kingsbury, D.W. ed New York: 
Plenum Press; 1991 347-382 

Murray, J., Selleck, P., Hooper, P., Hyatt, A., Gould, 

Gleeson, L . , Westbury, H., Hiley, L . , Selvey, L., 
Rodwell, B., and Ketterer, P. 
Science, 1995 268 94-97 

staft/na/koep/SP EC 1S/B10MOLECULA R .viral 4. . 6 



- 35 - 

31. Norrby, E., Mufson, M. A. , Alexander, H., Houghten, 
R. A. and Lerner, R . A. 

Proc. Natl. Acad. Sci. USA, 987 84 6572-6576 

32. Olmstead, R.A., Murphy, B.R., Lawrence, L.A., 
Elango, N . , Moss, B. and Collins, P.L. 
Journal of Virology, 1989 63 (1) 411-420 

33. Olson et al 

J. Med. Chem., 1993 36. 3039-3049 

34. Roepstorff, P. and Fohlman, J. 
Biomedical Mass Spectrometry, 1984 11^ 601 

35. Rouse, J.C., Yu., W. and Martin S.A. 

Journal of American Society for Mass Spectrometry, 
1995 6_ 822-835 

36. Rueda, P., Garcia-Barreno, B. and Molero, J. A. 
Virology, 1994 198 653-662 

37. Satake, M., Coligan, J.E., Elango, N. , Norrby, E. and 
Venkatesan, S. 

Nucleic Acids Research, 1985 21 7795-7812 

38. Simard, C, Nadon, F. , Seguin, C. and Trudel, M. 
Antiviral Research, 1995 28 303-315 

39. Spengler, B., Kirsch, D . , Kaufmann, R. and Jaeger, E. 
Rapid Communications in Mass Spectrometry, 1992* 6_ 105- 
108 



40. Sullender, W.M., Anderson, K. and Wert z, G.W. 
Virology, 1990 178 195-203 



staff/na/keep/SPEC!S/BlOMOLECULAR. viral 4.6 



41. Sullender, W.M. Mufson, M.A., Anderson, L . A. and 
Wertz, G.W. 

Journal of Virology, 1991 .65. 5425-5434 

42. Sullender, W.M. and Wertz, G.W. 

The Paramyxoviruses. Kingsbury, D.W. ed. New York: 
Plenum Press, 1991 383-406 

43. Trudel, M. , Nadon, F . , Seguin, C. and Binz, H. 
Virology, 1991 185 749-757 * 

44 . Walsh. 

Journal of General Virology, 1984 6!5 761-767 

45. Wertz, G.W., Collins, P.L., Huang, Y . , Gruber, C, 
Levine, S. and Ball, L.A. 

Proc. Natl. Acad. Sci. USA, 1985 82_ 4075-4079 
BIOMOLECUIiAR RESEARCH INSTITUTE 5 June 1996 



staftfna/keep/SPECISmiOMOLECULAR.vira] 4.6 



1/14 



O 
O 

O 
O 



> 



> 



I 



o 



o 
o 

o 
o 



I 



o 



< 



00 



FIGURE 1 



2/14 



to 

CD 



O 
CD 



< ^ CO o 

_ cm t- in a> 

? CD CM t- 



CNJ 
< 



2£ 
Cu 

cu 
i — i 
a: 

U 

i — i 
< 
3: 

o 

cu 



CO 

O 

i — i 

CO 

O 

CU 

> 

> 
Q 



CU 

CO 
CU 
CU 

a 

OS 
O 



a: 













CO 


to 


co 


to 


CO 




CO 


to 






< 


< 


CD 




a: 


1 



CO 



in cr> 

tO CM 

CO to 



o 

t— CO o 

o \r oj 

O CO CO 



o o 
o o 



o cr ai 



^ ^ ^ ^ ^ — U LL U. 

<<<<<<cQmmmcDcQcQm<< 



i 

i 
i 

z 

CO 

I 
I 

) 
I 

I 



t l 

! I 
I 



I 
I 

CO CO 



I 
I 
I 

I 
I 
I 
I 
I 
I 
I 
I 
I 

>- 

I 

Q 

I 
I 

I 

I 



I 



I 
1 

I 
I 



I 
I 

1 
I 



I I 
I I 

I 



CO CO 



I 
I 

CO CO 



I 
I 

I 

I 

I 



I 
I 

I 

I 

I 



I 
I 

E-i 
1 
I 
I 



CO 

I 

I 
I 

I 



I 
I 

I 
I 
I 



I 
I 
I 

I 
I 
I 
I 
I 
I 
1 
I 
I 

>- 
I 

Q 



I 
I 
I 

t 
I 
I 
I 
t 
I 
I 
I 
I 

t 

a 



i i 
i i 



I I 



i I 



! 

Q 



I 



I 
I 

I 
I 

I 



CO 


00 


CO 


00 


CO 


CO 


CO 
















1 


I 


i 


1 


1 


I 


1 


1 — -] 










i-q 




o 
1 


a 
i 


o 
i 


a 
i 


a 
1 


O 
I 


o 

1 


1 

O 


i 


CD 


i 

cd 


1 

O 


i 

CD 


1 

CD 


1 


I 


1 


i 


1 


1 


1 



I I 1 I 



I I 

>-« >H 



I 

Q 



I I 
I I 



< I 
c£„ I 

CO 

t 

< 



I 

CO 

I 
I 

cd 

CO 

I 

E 1 
I 
I 
I 
I 

>H 

Cu 

O 
I 

Q 
O 
rc 

CO 
I 
I 



1 
I 
I 
I 
I 
1 
I 
I 
I 
I 
I 
! 
I 

I 
I 
I 
I 
I 
I 

E-h 
I 
I 
I 
I 

CU 

1 
I 
I 
I 
I 
I 
I 
I 
I 

I 

I 



^-CMCO^mCDNOOO) 









2 




2 




1 — 1 
















CO 


CO 


CO 


CU 








CO 


CD 


1 


1 


1 


1 


1 




1 


CO 


CO 


CO 


cu 


Cu 


cu 


cu 


CO 


cu 


1 


1 


1 


1 


1 


1 


I 






o 




CNJ 


CO 




to 


CO 



FIGURE 2 



3/14 



O 
CD 









a 












c 






05 






CD 






<u 






CO 






>1 




TP 
1. c. 


re 


in 




<u 




CD 


a 


c 




c 








-H 




o 




O 


he 


ta 








t— i 


-p 


c 




o 




u 


c 


CO 




H 




o 








03 




T 1 




CD 




> 




C£ 


c 


C 










-H 


-H 




CD 




T3 


OS 


CD 




-UJ 




C 




-P 




C 




03 


-P 


CO 




-H 




i— I 


CO 


cy 




c 






< 






03 




u 




CD 








o 






x: 




I— 1 


c 


4_) 




-p 




cc 


o 






-H 








-P 








U b 


CD 


03 


in 






C 




T3 


CD 


H 




03 


-1— J 


CD 




-H 




-H 




4-> 


O 


x: 




M 


14-1 


U 


u 


u 




V a 


o 


re 


a 


cd 






c 


-H 


C3 


e 




< 


o 


T3 




05 






-H 




CD 


CO 




C 


-P 


>i 








03 


03 




-P 


CD 




£ 


Cn 


o 






CO 




03 




c 


-P 


on 


x: 


do 


ti 


O 


E 


-H 


CD 


u 


c 


c 


o 


-P 


x: 


a 


03 


o 


u 


03 


-P 






-H 




H 




>i 


i — I 






o 


M-l 


x> 


03 


CD 


CO 


CO 


o 




C 




CD 






T3 


o 




-P 




CO 


CD 


« — i 


-P 


05 




CD 


-P 


o 


C 


• — 1 


CD 


•P 


03 


o 


03 


O 


CD 


03 




c: 


4-> 


CO 




< — 1 


CD 


o 


CO 


1— 1 


•P 


O 


C 


e 


c 




CD 


CO 


CD 




o 


* 


Xi 


\— 1 




03 


u 



FIGURE 2 (cont 



4/14 



CD 

o 

.Q 

O 
CO 

< 




Time (min) 



FIGURE 3A 



f 



5/14 




6/14 



a. i. 



5500 



5000 



4500 



4000 



3500 



3000 



2500 



4- vi-b 



2000 



1500 



1000 



500 




i < i ' i ' r 



3500 



3700 



i 1 i 

3900 



~i ' ! 

4100 



FIGURE 4A 



7/14 



a. i. 



4000 



2000 



-2000 



-4000 



-6000 



-8000 



I4l0 O 



~i - r 



1600 




~1 ' i 

1800 



] 1 r 

2000 



2200 



FIGURES 4B TO 4D 



8/14 



QNKPPSKPNNDFHFEVFNFVPCSICSNNPTCWAICK Fragment 1 

I I 4124.7/4107.7 



ONKPPSKPNNDFHFEVFNFVPCSICSNNPTCWAICK Fragment 1 R 

4128.7/41 11.7 



NFVPCSICSNNPTCW AICK Fragment 2 
I I 2115.49 



NFVPCSICSNNPTCW Fragment 2R 

1685.95 



NFVPCSICSNNPTCWAICK Fragment 3 
| | 2097.47 



NFVPCSICSNNPTCWAICK Fragment 3R 

2101.51 



ICSNNPTCWA Fragment 4 

I I 1107.26 



CSICSNNPTCWAICK Fragment 5 
, ( 1639.94 



CSICSNNPTCW A iCK 



Fragment 6 
1658.0 



FIGURE 5 



9/14 



a. i 



5000 



4000 



3000 



2000 



1000 



-1000 



-2000 



-3000 



-4000 



-5000 



-6000 



■LlCi 3> 



i 1 i 1 I 1 I 1 I 1 1 1 I 1 I 1 I 1 I 1 1 1 i 1 f 

1200 1400 " 1600 1800 2000 --2200 m/z 



FIGURE 6 



10/14 



a. i. 



6000 



4000 



2000 



-2000 



-4000 




900 



1000 



1100 



1200 



m/z 



FIGURE 7 



11/14 




1300 



n 1 | 

1500 



l ' r 



1700 



1 ' " 

1900 m/z 



FIGURE 8 



12/14 




FIGURE 9 A 



13/14 



a. i. 



6000 



5000 



4000 



3000 



2000 



1000 



-1000 



-2000 



-3000 



-4000 



-5000 



c 



■ o 




500 



I 

1000 



1500 



2000 m/z 



FIGURES 9B & 9C 



14/14 



/ 



CO 



y 



o 



CO 

o 



CO 



- 7 o 



Q_ 



/ 

> 



/ 

cm LL 



y 



CO 



y 



C\J 



y 



y 



CO 



y 



FIGURE 1 



THIS PAGE BLANK (uspto) 



